Diversity in Ensembles for One-Class Classification
نویسنده
چکیده
09:00 – 10:30 Workshop on Mining Complex and Stream Data (MCSD 2012) Machine-generated Data Analytics: Challenges and Opportunities Graham Toppin (invited talk) SONCA. Scalable Semantic Processing of Rapidly Growing Document Stores Marek Grzegorowski, Przemysław Wiktor Pardel, Sebastian Stawicki, Krzysztof Stencel Soft competitive learning for large data sets Frank-Michael Schleif, Xibin Zhu, Barbara Hammer 10:30 – 11:00 coffee break
منابع مشابه
A High-Performance Model based on Ensembles for Twitter Sentiment Classification
Background and Objectives: Twitter Sentiment Classification is one of the most popular fields in information retrieval and text mining. Millions of people of the world intensity use social networks like Twitter. It supports users to publish tweets to tell what they are thinking about topics. There are numerous web sites built on the Internet presenting Twitter. The user can enter a sentiment ta...
متن کاملClass-Specific Ensembles for Active Learning in Digital Imagery
In many real-world tasks of image classification, limited amounts of labeled data are available to train automatic classifiers. Consequently, extensive human expert involvement is required for instance labeling. Detecting Egeria densa in digital imagery is one such real-world classification task. It presents an additional challenge due to subtle spectral changes in Egeria, which makes it diffic...
متن کاملModerate diversity for better cluster ensembles
Adjusted Rand index is used to measure diversity in cluster ensembles and a diversity measure is subsequently proposed. Although the measure was found to be related to the quality of the ensemble, this relationship appeared to be non-monotonic. In some cases, ensembles which exhibited a moderate level of diversity gave a more accurate clustering. Based on this, a procedure for building a cluste...
متن کاملEnsembles of (α)-Trees for Imbalanced Classification Problems
This paper introduces two kinds of decision tree ensembles for imbalanced classification problems, extensively utilizing properties of α-divergence. First, a novel splitting criterion based on α-divergence is shown to generalize several wellknown splitting criteria such as those used in C4.5 and CART. When the α-divergence splitting criterion is applied to imbalanced data, one can obtain decisi...
متن کاملMeasuring Diversity in Regression Ensembles
The problem of combining predictors to increase accuracy (often called ensemble learning) has been studied broadly in the machine learning community for both classification and regression tasks. The design of an ensemble is based on the individual accuracy of the predictors and also how different they are from one another. There is a significant body of literature on how to design and measure d...
متن کامل